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Rule 110 is a cellular automaton that performs repeated simultaneous updates of an infinite row of 
binary values. The values are updated in the following way: Os are changed to Is at all positions 
where the value to the right is a 1, while Is are changed to Os at all positions where the values to the 
left and right are both 1. Though trivial to define, the behavior exhibited by Rule 110 is surprisingly 
intricate, and in | l l| we showed that it is capable of emulating the activity of a Turing machine 
by encoding the Turing machine and its tape into a repeating left pattern, a central pattern, and a 
repeating right pattern, which Rule 110 then acts on. In this paper we provide an explicit compiler 
for converting a Turing machine into a Rule 110 initial state, and we present a general approach for 
proving that such constructions will work as intended. The simulation was originally assumed to 
require exponential time, but surprising results of Neary and Woods |2 1 have shown that in fact, only 
polynomial time is required. We use the methods of Neary and Woods to exhibit a direct simulation 
of a Turing machine by a tag system in polynomial time. 

1 Compiling a Turing machine into a Rule 110 State 

In this section we give a concrete algorithm for compiling a Turing machine and its tape into an initial 
state for Rule 1 10, following the construction given in UJ. We will create an initial state that will eventu- 
ally produce the bit sequence 01 101001 101000 if and only if the coiTcsponding Turing machine halts. Or, 
if one prefers time sequences to spatial sequences, it is also the case that the sequence 1 101010101 1 1 1 1 1 
will be produced over time by a single cell if and only if the Turing machine halts. (These sequences are 
the shortest that occur for the first time in the collision that produces the F glider as in figure [Tll z). which 
only occurs if the algorithm halts.) While based directly on the methods of [1|, the presentation of the al- 
gorithm here is self-contained. This section can be viewed as heavily commented high level pseudocode, 
the intent being to explicitly provide all the details a program would use, regardless of programming 
language or input or output format. 

Following 111, we will convert the Turing machine into a tag system, which we then convert into 
a cyclic tag system, and finally into a bit sequence for Rule 1 10. Of course, if one is starting with a 
tag system, or a cyclic tag system (whose appendants' lengths are multiples of 6), then the unnecessary 
conversions may be omitted. For example, Paul Chapman's tag system 131 for the 3x + \ problem, 
given hy {A'^C,B^D,C^ AE,D'^ BF,E-^ CCD,F ODD}, with a starting tape of C[DY-\ or 
Liesbeth De Mol's more recent tag system for the same problem f4], {A ^ CY,C A,Y -^AAA}, can 
be implemented nicely in Rule 1 10 without going through a Turing machine representation. As another 
example, the cyclic tag system {YYYYYY,<l),NNNNNN ,®}, starting with a single Y on the tape, yields 
exactly the same behavior as the cyclic tag system shown in figure 2 of lUl with runs of Ys (after the 
initial Y) doubled and runs of A'^s lengthened by a factor of six. This cyclic tag system is also shown on 
Page 96 of im, which shows a random-looking graph of its behavior during the first million steps. 

If we start with a cyclic tag system whose appendants are not each a multiple of six long, we can 
convert it into such a form as follows: Expand each appendant by adding 5 A'^ symbols after every symbol, 
and expand the list of appendants by adding 5 empty appendants after every appendant. The tape should 
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also be expanded just as each appendant was. This will make every appendant's length be a multiple of 
six, while performing the same computation, on every sixth step, as the original cyclic tag system. 



1.1 We start with a Turing machine 

Suppose we ai^e given a Turing machine with m states *P = {y\,W2t ■ ■ ^Wm} and t symbols 
r = {a\ , G2, . . . , af } and we are given its lookup tables for which symbol to write T(i//,-, Oj) G £, which 
way to move A(i//,-, Oj) € {left, right, halt}, and what state to go into r{Yh ^j) ^ and T do not need 
to be defined for (state, symbol) pairs that cause the machine to halt, i.e. for which A is halt.) 

Suppose further that the Turing machine is currently in state y/y, and that the tape is of the form 



where the box represents the current position of the Turing machine's head, and the portions with lines 
over them ai^e repeated into the distance of the two-way infinite tape. 

Our task is to convert this complete description of the Turing machine and its infinite tape into an 
initial state for Rule 110. 



1.2 We transform it into a tag system 

Our first transformation will be to convert this Turing machine into a set of tag system rules for a tag 
system with deletion number s, meaning that at each step, s symbols ai^e removed from the front of the 
tape, and an appendant is appended according only to the first of the s removed symbols. There are two 
general approaches for doing this. The traditional (since 1964) approach of Cocke and Minsky 161 is 
simpler, but results in an exponential slowdown in simulation time. The modern (since 2006) approach 
of Neary and Woods |l2j| is more complicated, but amazingly solves what I call the "geometry problem" 
of cyclic-state tape processors such as tag systems and cyclic tag systems. The geometry problem in this 
instance is that the processing head is unaware, as it scans the tape, of which tape symbols are next to 
which other tape symbols. This is because it effectively has a fixed number of bits of memory, consisting 
of the phase of the head with respect to the tape, and these bits can only be read or written with difficulty. 
One difficulty is that the bits must be shared for an entire pass of processing the tape, meaning that the 
machine is unable to simply remember at each step what symbol it just saw, as a Turing machine might 
do. Indeed, everything the machine does remember is due to the sum total effect of the previous pass, 
rather than the current pass. This makes it nearly impossible for the system to know the ordering of 
the symbols on the tape. Without being able to detect the order of the symbols, encodings are limited 
to a unary representation, which makes the tape exponentially larger than a binary representation, thus 
making passes over the tape take exponentially longer. 

In this section we will perform the transformation using the simpler method of Cocke and Minsky. 
For an explanation of why it is that a transformation like this results in a tag system that correctly 
emulates the Turing machine, see |1]. Here we just focus on the mechanics of the transformation itself. 
In section [3] we will show how a conversion like this can be done using the more complicated method of 
Neary and Woods. 

To enable our transformation, we will first add two symbols to the alphabet, Ot+\ and Of ^2, for a total 
of s = t + 2 symbols. These two new symbols will be used to mark the left and right ends of the central 
nonperiodic portion of the tape. We will not need to define T, A, or Y for Ot+\ or 0^+2- 
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Now we can transfomi the Turing machine into a set of tag system rules on an alphabet <I> of 4m + 3ms 
symbols: 

Hff,. , Ly. ,Ry., R^.^ for each / G { 1 . . . m} 1 
Hy_(jj , Ly^Gj , R\i/,(Jj for each / G { 1 . . . m} and j G { 1 . . . 5} J 
The tag system rules, one for each symbol in are based on T, A, and F as follows: 



















R\i/i 




R\ifiaiR\ifia2 ■ • 


■R\l/ia, 


Rxfij* 




[Rw^' 




'=left 




[^r(v/,-,f7;)*]'''" 





^\tiiaj\A(\ifi,aj)=halt ^ ® 

^Yi<yj\MVi,<yj)=right ^ [i-r{<i/i,aj)Y 
^\l/iaj\A(\i/i,c!j)=halt ® 

^w,oMiv„<yj)=left [^r(v^„fT,)]'' 
^Yi(jj\A{w,,aj}=right ^Hv.^aj) 
^\\iiaj\A(\i/i,c!j)=halt ^ ® 

The notation [ij^mZ^o/]" represents n consecutive copies of the symbol. Each symbol whose subscript is 
a pair (i//,-, Oj) that causes the Turing machine to halt leads to an empty right hand side, denoted by 0. If 
the Turing machine does not halt, these empty right hand sides will never be used. 

Additionally, we transform the Turing machine tape into the following tag system tape: 

This completes our transformation of the Turing machine into a tag system with deletion number s. 

If for some reason one wants to to avoid tag system rules that are exponentially long (in the length 
of the Turing machine's tape's periodicity), then instead of using one new symbol at the end of the tape 
to extend the tape by another period, one can use many new symbols, each of which extends the tape by 
a limited amount. Similarly, if one wants to avoid an initial tag system tape that is exponentially long 
(in the length of the initial Turing machine tape), new states can be added to the Turing machine for 
the sole purpose of writing the initial tape and positioning the Turing machine on it. Using these two 
methods, the entire compilation algorithm in this section takes only polynomial time in the size of the 
Turing machine's initial configuration, and creates a Rule 1 10 initial state of polynomial size. 
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1.3 We further transform it into a cyclic tag system 

Our next transformation will be to convert the tag system tape and rules into a tape and cyclic appendant 
sequence for a cyclic tag system. 

Cyclic tag systems were invented as part of the proof of Rule 1 lO's universality, but they are also 
interesting in their own right. For example, Neary and Woods introduced their method in [2'] by showing 
how a cyclic tag system can emulate a Turing machine. They simply cycle through a list of appendants 
as they read the tape, appending the current appendant to the end of the tape whenever a F is read. 

To begin this transformation, we assign an ordering to the tag system alphabet <I>: 0i = H^r^ , ^2 = 
H^,^, <p4m+3ms = ^v/„,o-, Ncxt, wc extend so that its size |<I>| becomes a multiple of 6, by adding 
anywhere from to 5 dummy rules to the tag system, of the form: ^4m+3ms+\ 0, <l>4m+3ms+2 '^0, • • • 

Now we can create the cyclic tag system by listing the tag system's rules in order, and converting 
each right hand side into a string of Fs and Ns via a simple unary encoding where each (pi becomes a 
string of \<t>\ Ns with the one changed to a Y: (pi ^ [N]'^'^ Y [A^]l*l"' 

So for example the first rule, H^^ ^v'iffi^Viff2 • • -^Vifi rewritten using the (/>;S as (pi 

^4m+\^4m+2- ■ -^Am+s whosc right hand side gets converted to 

^4m y ^|0|-4m-l ^4m+l y ^|0|-4m-2 ^4m+.v-l y ^|<J.|-4m-. happens tO simplify tO 

j^4m Y 7j'«-i j^j\<^\-4m~s ^^^^ jj^j^ jj^g ^j.^^ appendant in the cyclic tag system's cyclic list. 

The cyclic tag system's cyclic list starts with the |0| appendants that can be generated in this way 
from the rules of the tag system, and then it is extended to a length of s\^\ by simply adding {s — 1)|<J>| 
empty appendants. 

The initial tape for the cyclic tag system is simply the unary encoding of the tag system's tape into Fs 
and A'^s. This completes our conversion of the tag system into a cychc tag system with appendants. 

1.4 We finally convert it into a Rule 110 state 

Our final transformation will be to convert the cyclic tag system's tape and appendants into an initial 
state for Rule 1 10. We will do this by simply gluing together bit sequences for the various glider clusters 
involved. We will start with the central tape region, and then we will specify the periodic sequences to 
its right and left. 

We will start the central bit sequence with the row marked in figure [T] block C. We will first extend 
this row to the right by attaching other blocks. Each time we attach another block, we do it so that the 
zig-zag seam fits together perfectly, as if the two blocks were pieces of a large simple jigsaw puzzle, 
without worrying about whether the top or bottom edges of the blocks are aligned. Usually, the new 
block will not be able to be at exactly the same height as the previous block, but we simply need to make 
sure that the new block extends the t = row marked in block C. This row will be the initial state for 
Rule 110. Programmatically, "attaching the blocks" is a very simple process: As we extend the initial 
state, we just keep track of which phase of the zig-zag the t = row is at every time we cross from one 
block to another. This phase tells us which row to use from the new block, and then the phase of the 
zig-zag at the end of that same row in the new block becomes the phase for the next crossing. 

The central region is formed fairly directly from the cyclic tag system's tape: Each A'^ becomes ED, 
and each F becomes FID, but then the very last ID is changed to G, and C is stuck on the front. So for 
example, a tape of NNYN would become CEDEDFBEG. 

Next, we form the periodic right hand side with a periodic sequence of blocks based on the cyclic 
tag system's appendants. Each appendant from the cyclic tag system's list is converted by changing each 
F to II and each A'^ to IJ, but then the very first I is replaced with KM.. If there is no first I, due to 
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Figure 1: Blocks of bits used in generating the initial state for Rule 1 10. Two blocks are joined by simply 
fitting them together along the zig-zag edge so that the t = row, defined in block C, gets extended into 
the new block. Each block except for C is periodic: Blocks A and B repeat every 3 lines, and the other 
blocks repeat every 30 lines. The cyclic left hand side of Rule 1 lO's initial state is built with blocks A 
and B. The central region is built with blocks C - G. 
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Figure 2: More blocks of bits used in generating the initial state for Rule 110. The blocks in this figure 
are used to create the cyclic right hand side of Rule llO's initial state based on the cyclic tag system's 
appendant list. A program for creating the Rule 110 initial state would just store 30 strings of bits for 
each block, along with the vertical phase offset of the right hand zig-zag from the left hand one. Note 
that there is an E glider traveling right in the middle of every zig-zag region, so two blocks always join 
at an E. 
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the appendant being empty, then an L is used for that appendant. Once this is done for all appendants, 
the initial K of the first appendant is moved to the very end. So for example, the cyclic appendant list 
{YN, NYYN, 0, 0} would become MIIJIKIHJJffllOLLlK. This sequence of blocks gets repeated on the 
right hand side, and so the bit sequence for the t = row must also be periodic, since after some number 
of repetitions of the sequence of blocks, the bit sequence will enter the same row of the initial block as it 
did at the very beginning, and so the bit sequence becomes periodic at that point. 
The periodic left side has the form: [A]" B [A] '-^ B [A] ' ' B [A] jg 
where v = 76 • (the total number of Fs in all appendants) 
+ 80 • (the total number of A'^s in all appendants) 
+ 60 ■ (the number of nonempty appendants) 
+ 43 • (the number of empty appendants) 
Calculating the periodic sequence of bits for the t = row on the left works just like calculating the 
sequence on the right, except that we work our way to the left from the t = row in block C, crossing 
over from block C to block B, then twelve copies of block A, and so on. The bit sequence will have to 
go through the [A]''B [A]^^B [A]^'B [A]^^B block sequence three times before it starts repeating. 

This completes our algorithm for transforming an arbitrary Turing machine into an initial state for 
Rule 1 10 consisting of a periodic sequence of bits on the left, followed by a central sequence, followed 
by a periodic sequence on the right. 



1.5 Some comments on this algorithm 

The blocks in figures [T] and |2] correspond to the conceptual clusters of gliders used in the construction 
inm. 

For example, the BA'-'BA'^BA'^B assembly is an ossifier, containing an A'^ in each B, separated 
by pure ether in the As. 

The E block, including the £s at its seams, is a moving data N, while the F block is a moving data Y. 
The D block glues adjacent elements of moving data together. 

The remaining blocks each contain a glider cluster, extending from the first fully present glider to the 
E in the right seam, so only the E in the left seam is not a member of the cluster. The E in the left seam 
is simply the last E of the previous cluster, thus providing the position of the block's cluster in relation 
to the previous cluster. 

The H is a primary component, while I and J are standard components, differing only in the spacing 
from the previous component. The J uses more space, so that IJJ encodes an A'^ of table data, while II 
encodes a 7 of table data. 

The IC block is a raw leader, and the L block is a raw short leader. The G block is a prepared 
leader. There is no block shown for a prepared short leader, since our algorithm assumed that the first 
appendant would not be empty. If the first appendant is empty, we would need a block for a prepared 
short leader, to be used in place of the G. This block would be a modified form of L using exactly the 
same modification that turned the IK into the G. 

The calculation of v for the periodic left hand side corresponds to a conservatively large rough esti- 
mate of twice the total vertical height of all the table data, including both components and leaders. This 
is used to set the vertical spacing between ossifiers so that ossifiers do not hit tape data by mistake. This 
assumes that at least one nonempty appendant will be appended on each cycle through the appendants 
if the system is not halting. This will indeed be the case if the system has been compiled from a Turing 
machine as described above, since the transformation to a tag system specifically ensures this property. 
If the system was compiled directly from a tag system or cyclic tag system, then an appropriate value of v 
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can be chosen if a bound can be placed on the number of consecutively rejected or empty appendants. If 
there is no such bound, then it will not be possible for this construction to work with a periodic left hand 
side. In this case, a more complicated left hand side will work, where the value of v increases linearly 
with each ossifier, since the length of the tape can only increase Unearly with time, and clearly at any 
time the length of the tape constitutes a bound on the number of consecutively rejectable symbols. Note 
that if the entire tape is rejected before the next ossifier anives, then there will be a collision between a 
prepared leader and an ossifier, as opposed to an ossifier hitting tape data as in figure [TH z). 

The reader has probably noticed that we are encoding the tape as moving data. Although it might be 
more natural to encode it as tape data as described in d, encoding it as moving data gives us some nice 
computational simplifications: Only two slopes of glider appear in the initial Rule 1 10 state, each glider 
only needs to be positioned relative to its immediate neighbors, and the moving data does not need to be 
reversed from the cyclic tag system tape. 

The algorithm forced the tag system to have a size that is a multiple of six so that every appendant in 
the cyclic tag system would have a length that is a multiple of six. This is required so that the leader after 
a rejected appendant will hit the next symbol of tape data correctly (requiring the rejected appendant to 
have even length, as shown in figure fTTT y)). producing invisibles that indeed pass through the ossifiers 
(requiring the rejected appendant to have length a multiple of three, as shown in figure Oo)). 

A cyclic tag system appendant in this construction may have a length that is not a multiple of six 
only if the cyclic tag system will always append that appendant. Otherwise the construction will not 
work properly. The safest approach is clearly to always use appendants whose length is a multiple of six. 



1.6 Converting back to a Turing machine 

We can continue with our series of conversions by converting the Rule 1 10 initial condition back into an 
initial state for a Turing machine tape. The benefit obtained from this big cycle of conversions is that 
the final Turing machine can be very small, since it only has to emulate Rule 1 10. The program, on the 
other hand, will have become much larger after going through the extensive compilation process in this 
cycle of conversions. Here we will show some Turing machines that can emulate Rule 1 10, and give an 
example for each of how it would emulate the evolution of the standard ether pattern. Encoding the Rule 
1 10 inital state into an initial tape for these Turing machines is fairly direct and we will not discuss the 
details of these transformations. 

Figures[3]and|4]show the lookup tables for the Turing machines of |[T1. The captions give examples of 
initial tapes that allow the Turing machines to implement Rule 110 logic. Each machine emulates Rule 
1 10 by sweeping back and forth over an ever wider stretch of tape, with each rightward sweep computing 
one more step of Rule 1 lO's activity. 

More detailed initial tapes can be used to make the machines perform more detailed Rule 1 10 com- 
putations, for example by inserting gliders into a central ether region, or including gliders in the periodic 
portion. Note that the periodic patterns on the sides of the Turing machine's tape correspond to periodic 
paths through both space and time on the sides of the Rule 1 10 evolution. 

In contrast with the Turing machine of figure |3l the other three machines operate in such a way that 
the Rule 110 rows shift one cell to the right on the Turing machine tape with each simulated time step. 

More recently, Neary and Woods [7J have achieved even smaller Turing machines that similarly 
emulate Rule 110. 
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Figure 3: This 2 state 5 symbol Turing machine can be stalled in state Sq on the two-way infinite tape 
[0^010]° 



0^ 



[t^ 1 7^ 1 0^ 0^]°°, starting on the cell marked here with a box, and it will start computing Rule 
llO's ether pattern. This Turing machine is slightly modified from the one in |[T1 so that fewer transitions 
are needed (the unused one is marked with "X"). 
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Figure 4: The 3 state 4 symbol Turing machine can be started in state Sxo on the tape 



[0/,1lOr]°° Ol [OlIlIkOrOl]" to compute the ether pattern. The 4 state 3 symbol Turing machine can 
be started in state Sxo on the tape [BO 1]°°|b][B 1 1 1 1 1 OB] ~ to compute the ether pattern. The 7 state 
2 symbol Turing machine can be started in state Sn on the tape [1 1001 l]~[o][101 1010010]°°, and it will 
compute the ether pattern on every second cell of the tape. This machine was compressed down to seven 
states by David Eppstein HI. 
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2 Towards a formal proof approach 

The original exposition of the Rule 110 construction in m took a motivated approach to understanding 
how the construction could be made to work, by showing how some parts of the construction placed 
requirements on other parts of the construction, and then showing how the other parts had enough flexi- 
bility to satisfy those requirements. Only the inner workings of the components and leaders were treated 
as given without analysis. Of course, those could have been explained as well, as they were similarly de- 
signed by examining how to satisfy the relevant constraints, but this would have significantly lengthened 
the exposition. 

In the present paper we take the converse approach, treating the entire construction as handed to us 
in a completely specified form, as given in section [H for which all we have to do is check that as Rule 
1 10 acts on the initial state, all of the ensuing collisions will be of the right form. 

Each collision, whether between individual gliders or between clusters of gliders (which can be 
thought of as very large gliders), is completely determined by the spacing between its parts. In Rule 
1 lO's two dimensional space-time, spacings can be completely specified in terms of ^ distance ("over 
distance") and y distance ("up distance"), described in HI. 

Our general approach is that each time there is a collision, we can determine the spacing between its 
parts by examining the spacings among all of the previous collisions bordering the ether region above the 
collision in question. If every such region can be shown to lead to the proper collision, then by induction, 
all the collisions will be con^ect. 

All the collisions (at a glider cluster level) in this construction are between E material (leaders, table 
data, invisibles, moving data) on the right and either Cj material (tape data) or A material (ossifiers, 
acceptors, rejectors) on the left. 

The slopes of the A gliders and C gliders, are directly related to the measurements that we need. 
When C2 material hits E material, the correctness of the collision depends only on the ^ distance mod 
4. When A material hits E material, the correctness of the collision depends only on the y distance mod 
6. The values are only important mod 4 and mod 6 because of the periodicity of the E. 

For most of the collisions, some participants in the collision have emerged from crossing collisions. 
In these cases, we measure the relevant distance back at the point when those gliders were first created, 
before all the crossing collisions, since the crossing collisions do not affect the relative distances. Only 
collisions with acceptors and rejectors are never preceded by crossing collisions. 

This construction uses only the following collisions: 

• ossifiers hitting moving data or invisibles 

- Measurements of these colhsions are shown in figure [6] 

- The moving data is created with the correct spacing as shown in figure |7] 

- The invisibles are created with the correct spacing as shown in figure [8] 

• tape data passing through moving data or invisibles 

- Measurements of these colhsions are shown in figure [5] 

- The tape data is created with the correct spacing as shown in figure [6te). 

- The moving data and invisibles are created with the correct spacing as shown in figure [TOl 

• an element of tape data hitting a prepared leader 

- Measurements of this collision are shown in figure [TlTv). 

- The prepai^ed leader is created with the con^ect spacing as shown in figure [TTT w.x.v). 
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• an acceptor or rejector hitting table data 

- Measurements for these collisions are shown in figure I?] 

• an acceptor or rejector hitting a raw leader 

- Measurements for these collisions are shown in figure |9l 

We avoid an overly formal style in this presentation, but the general approach used here would be 
a good starting point for a complete formalization of the proof, a formalization of the sort that could 
be checked automatically by an automated proof checker. Most of the simpler spacing claims have 
essentially already been checked insofar- as simulations of this construction appear to work with no 
problems. But of course most of the more general claims that appear cannot be verified in their full 
generality so trivially. 

Variations on the Construction 

Other than the A'^ /s E reaction used for ossification, shown in figure Oe), there are two other reac- 
tions between an and an E that result in a C2. One of them, in which the £ is /f from the A^, is clearly 
unusable because it does not allow the ossification of figure [6te) to work, since the ^ distance between 
the new C2 and the E is one, not zero. The other, in which the E is from the A'^, does not have this 
problem. If it were used, then the leaders and primary components would need to have different internal 
arrangements. It turns out that, regardless of the details of these arrangements, the equation of invisibility 
after rejection, 4 + 1 + (2c — l)-5 + 2 + 5 + 5 = 0, from figure [8lm,o) and figure Og), which contains 
values that derive from nearly every aspect of the construction, would wind up being odd on one side and 
even on the other, and so the equation would be violated regardless of the value of c, and the construc- 
tion would not work, regardless of any restrictions on the lengths of appendants. The construction as 
originally designed in February of 1994 suffered from this problem, which was discovered and fixed in 
the ensuing weeks. Note that (5] is mistaken when it says on page 1115 that some mistakes in the proof 
were corrected in 1998. In fact, the basic construction, completed in 1994, leaves a lot of flexibility in 
choosing many of the spacings (even within the leaders and components, whose internal design is not 
discussed in IH), and what happened in 1998 was that a particular- set of aiTangements were chosen (for 
compactness) while writing a program to automatically generate a Rule 110 initial state con^esponding 
to an arbitrary cyclic tag system (see IH, page 1116). Finally, in 1999, a standardized set of methods of 
measurement and analysis were chosen during the writing of the exposition of the 1994 construction, the 
publication of which (in HI) was delayed until after the publication of HI. 

The astute reader may notice that primary components (as well as short leaders) could be avoided if 
the acceptor and rejector could be produced from where the leader currently produces them. The 
reader is invited to attempt this simplification, preferably without increasing the size of the construction. 

3 A Polynomial Time Simulation 

In im, Neary and Woods solved the geometry problem described here on page|32]by using the following 
idea, which we present in a form adapted to tag systems. (Another adaptation to tag systems is given in 
Neary 's thesis Q.) 

First of all, note that on a circular- tape, it is sufficient for the Turing machine head to always move to 
the right. For example, the natural approach to simulating a cellular automaton works in this way: The 
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C2C2C2C2 E E E E 




E C2 C2 E C2 

(c) id) 

Figure 5: An E crossing a €2- Vertical columns of dots represent the vertical columns of ether triangles 
used when measuring ^ distance, which we treat as a value mod 4 due to the periodicity of the E. 
Sequences of pai^allel short vertical columns of dots represent the mai^king of every fourth column of 
ether triangles for convenience of measurement, (a) Some basic measurements of the unique collision in 
which an E crosses a €2- Note that the three measurements shown logically imply that the regenerated 
C2 must be ^ from the regenerated E. (b) Putting together some measurements from (a), we see that 
two consecutive Es can both cross a C2 if and only if they are ^ from each other. Note that the spacing 
between the Es will be the same after the collisions as before the collisions, since each E undergoes an 
identical displacement in the collision, (c) Similarly, we see that consecutive C2S can both cross an E if 
and only if they are which is from each other. Note that they will still be A from each other after 
the collisions, since they each undergo the same displacement, (d) If we align several C2S and Es so that 
the first collisions are crossing collisions, then what will the remaining collisions be? After crossing the 
first E, the C2S are still ^ from each other, so the second E will also cross them all, and indeed, all of 
the Es will cross all of the C2S. 
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(g) (h) 

Figure 6: How an E can hit an A^. Vertical columns of dots are as in figure [5] Diagonal rows of dots 
represent the diagonal rows of ether triangles used when measuring y distance, which we ti^eat as a value 
mod 6 due to the periodicity of the E. Sequences of parallel short diagonal rows of dots represent the 
marking of every sixth row of ether triangles for convenience of measurement, (e) There are three ways 
an E can hit an to produce a C2, but only one works for the construction. It is the one where the E is 
/f from the A^. The resulting C2 is from the E that created it. (f) An E will cross an A^ if and only 
if it is /§ from the A^. The crossing causes a visually striking but irrelevant displacement (not indicated 
here) to the A^. The incoming A"^ is /f from the regenerated E. We see that consecutive A^s must be /f 
from each other if they are both to cross an E, and the construction always uses such a spacing between 
A^s. Every E will either pass through all the A'^s (we call such an E an "invisible"), or else it is "moving 
data" and will be converted ("ossified") into a C2 ("tape data") by the first A^ ("ossifier") it hits, as in 
(e). (g) After an invisible E, the distance to the next E determines whether it is another invisible or 
ossifiable moving data, (h) After a moving data E, the /" distance to the next E determines whether it 
is more moving data or an invisible. 
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Figure 7: Primary (pri) and standard (std) components will get processed correctly by either an acceptor 
(acc) or a rejector (rej). An acceptor converts components into moving data (md), whereas a rejector 
deletes components. The acceptor or rejector is produced by a prepared leader (prep) hitting a character 
of tape data (td), which produces a pair of invisibles (inv) as well as the acceptor or rejector. Note 
that lines in these diagrams represent clusters of parallel gliders, and the collisions between clusters are 
mai^ked with a circle representing the many collisions that occur where the clusters meet. A measurement 
to or from a cluster is made to or from the closest glider in the cluster, namely the one that touches the 
ether in which the measurement is being made. 
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td prep pri std raw 




Figure 8: Invisibles are the result of a prepared leader hitting an element of tape data, (m) The alignment 
of the invisibles {k + 5) depends on the alignment (k) of the prepared leader. The incoming invisible or 
moving data was originally produced as in (o) or (n). (n) When invisibles follow moving data, the leader 
was prepared by an acceptor, and the prepared leader's offset from the moving data isk = 0. As shown in 
(m), this gives the invisibles an alignment of ^ + 5 = 5, which will yield the correct interaction with the 
ossifiers, as shown in figure[6l;h). (o) When invisibles follow previous invisibles, the leader was prepared 
by a rejector. If c characters of table data were rejected, then k = 4 + l + (2c — l)-5 + 2 = 10c + 2. Since 
c is a multiple of 6, we get k = 2. In the special case c = 0, the previous leader was a short leader, and 
this figure does not apply, but we still get ^ = 2 as shown in figure |9tr). Either way, the invisibles are 
produced as in (m) with an alignment of /: + 5 = 1, yielding the correct collision with the ossifiers as 
shown in figure [6tg). 
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Figure 9: Raw leaders are placed from the previous component, which yields the correct alignment 
when prepared by either an acceptor (p) or a rejector (q). If there were no previous components, then 
the raw leader is placed from the previous short leader (r) (whose lower right gliders ai^e untouched 
by the preparation), again yielding the correct alignment for preparation. Raw leaders that come after 
short leaders yield prepared leaders aligned as in (r), with k = 2. This is the zero-component version of 
figure [Ho). Note that raw short leaders, due to the different position of their initial E, are always placed 
/+3 higher, as measured through the E"s, than the raw regular leaders shown here. 
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(«) 

Figure 10: Moving data and invisibles are created with the correct alignment to pass through tape data. 
The top collision in (s) is also the top collision in (t), but (t) applies even when a rejector is produced. 
The lower collision in (s) is also the top collision in (u), but (u) appUes also to later components, where 
the lower collision in (u) becomes the top collision in (u). (s) The first moving data after an invisible 
has correct alignment to pass through tape data, as in figure Ob), (t) An invisible always has correct 
alignment to pass thi^ough tape data, as in figure [5];a,c). (u) Moving data after previous moving data is 
also aligned correctly for passing through tape data, as in figure Ob). 
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Figure 1 1: A leader always gets prepai^ed, as in (w), (x), or (y), so that it will be positioned correctly, as 
in (v), to read an element of tape data. In (y), since c is a multiple of 6, the total '-^ distance mod 4 is 
l+0 + 3 + (2c-l)-(0 + 3)+0 = 6c+l = 1, as needed in (v). 
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C2 C2 C2 C2 




C2 F C2 C2 

(z) 

Figure 12: An F is produced if and only if the Turing machine halts. The tag system in section [T]is set 
up so that when the Turing machine halts, no more appendants are produced for the tape. This will cause 
ossifier A'^s to eventually hit a character of tape data, which leads to the production of an F as shown. 

Turing machine can store the local configuration of the tape in its finite state, and can read in the old 
configuration and write out the new configuration in a single pass. Extra cells can be written onto the 
tape at the wrap-around point as necessary, allowing it to grow. 

So, when the simulated Turing machine head is at a certain position on the tag system tape, what we 
would like to do is to find the next position on the tape to the right of the current position. The tag system 
can do this not in a single pass, but in a series of log(«) cycles, where each cycle consists of four passes. 
On the first cycle, it eliminates every second possibility. On the next cycle, it eliminates every second 
remaining possibility, and so on. After log(?i) cycles, the only remaining possibility is the first one: the 
next position on the tape. 

So that the system can know when log(n) cycles have occurred, it keeps on its tape a power of 2 that 
is larger than n, called the counter. It performs this same halving-of-possibilities process on the counter, 
where it is able to detect the end of the process since then for the first time there will be an odd number 
of possibilities remaining. 

When the tape grows in length past a power of two, then the counter must be doubled in size. This 
condition is detected by keeping track of a "growth flag" which is set at the beginning of each series of 
cycles, and then if the tape is ever noticed to have a non-power-of-two length, then the flag is reset. If 
the flag is still set at the end of the series of cycles, and the tape is being extended, then the counter size 
is doubled. 

In tag systems, one must take care that the counter behaves properly when the Turing machine "head" 
jumps across it, since then the counter gets processed twice before the tag system head is at the beginning 
of the tape again. 

In the tag system, we keep all sorts of information in the symbols, using a large alphabet, since the 
communication bandwidth between symbols is very low (a single bit or less per pass). For example, 
every symbol on the tape knows the cun^ent state of the Turing machine and the cun^ent stage of the 
simulation algorithm. We use six stages, and each stage does one pass over the tape. 
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We will use the following symbols: 

• For the head: H, h, P, Q 

P is used for a head remembering "A", Q for "B". 

• For the counter: UuXxVvYy 

X is an eliminated U, and V and Y are a form of U and X that represent the growth flag having 
been reset. 

• For the tape: AaBbCcDd 

A and B represent the two values used on the binary Turing machine tape. C and D are the 
eliminated forms of A and B. 

The lower-case letters are used when we need to be able to detect which of the two positions is being 
read or ignored. 

If the Turing machine has k states, then there are actually 6k symbols for each of the above symbols. 
We could write these for example as H4J being the H symbol that is used in stage 4 when the Turing 
machine is in state 7, but usually we will omit the indices, to avoid clutter. Not all of the 6k symbols are 
used, since not all of the letters are used in all stages. 

For clarity, we will also use "-" as a symbol, to be used in positions where we know it will not be 
read, and we will use "0" as a symbol with an empty appendant. These two symbols do not need any 
subscripts. 

Stages 

There is a cycle of four stages, with stage 4 going back to stage 1 until the first tape symbol has been 
isolated, at which point stage 4 goes on to stage 5 and then 6. Stage 6 performs the simulation of a step 
of the Turing machine and jumps back to stage 3 to start isolating the next tape symbol. 

We will start in stage 2, because it has the simplest form. At the front is Hh, then the tape is some 
combination of AA and BB, and somewhere there is a power of 2 of Uu's, at least as long as the A/B 
portion of the tape. 

We will explain the stages by following an example. 

Stage 2 

Example: HhAAUuUuUuUuBBAA (Turing machine tape is "A,B,A") 
Incoming parity: If reading second symbol of each pair, then the growth flag is unset. 
Outgoing parity: Start with first symbol of each pair. 

Main change: Cut number of U's in half, and check whether counter size should be doubled. 
Transfer to stage 3: 
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X * : Y Y 

V * : V 

Y * : Y Y 

Stage 3 

Example: H-AAUUUUBBAA 

Incoming parity: Start with first symbol of each pair. 

Outgoing parity: If reading second symbols, then there was just one U. 

Main change: Change odd U's (first, third, etc.) to UuXx, erase even U's. 

Transfer to stage 4: 
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d 






U 


* 


U 


u 


X 


X 


V 


* 


V 


V 


Y 


y 


X 


* 


X 


X 






Y 


* 


Y 


y 







stage 4 

Example: HhAaUuXxUuXxBbAa 

Incoming parity: If reading small letters, then first tape symbol is isolated, and we will go on to stage 5. 
Otherwise we go back to stage 1. 

Outgoing parity: Start with the first symbol of each pair. 
Main change to stage 1 : Change A to AaO, and B to BbO. 
Transfer to stage 1 : 

H * : H - 

A * : AaO 

B * : BbO 

C * : C C 

D * : D D 

U * : U U 

V * : V V 
X * : XX 

Y * : Y Y 

Stage 1 

Example: H-AaOUUXXUUXXBbOAaO 
Incoming parity: We read the H. 

Outgoing parity: If there were an even (resp. odd) number of A's and B's, then we will read the first 
(resp. second) symbol of each pair. 
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Main change: Every second A or B is turned into a C or D. 
Transfer to stage 2: 
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Stage 2 

Example: HliAAUuXxUuXxDDAA (read li) 
Stage 3 

Example: - H- AAVYYVYYDDAA (read H) 
Stage 4 

Example: HhAaVvYyYyYyDdAa (read H) 
Stage 1 

Example: H-AaOVVYYYYYYDDAaO (read H) 
Stage 2 

Example: HhAAVVYYYYYYDDCC (read H) 
Stage 3 

Example: H-AAVYYYYYYDDCC (read H) 
Stage 4 

Example: HhAaVvYyYyYyYyDdCc (read h) 

Main change to stage 5: H and U disappear. Isolated A/B becomes P/Q. C/D turn back into A/B. XfY 
becomes a mixed-stage form UXA'^Y The "head" has jumped from the H to the P/Q, so it has jumped 
onto the symbol it is reading. This may cause it to jump over the counter (the X'sAf's), which will be 
discussed below. 
Transfer to stage 5: 
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Note regarding the transfer: The x and y symbols are written with a stage 4 subscript! 
Stage 5 

Example: - P- VyVyVyVyB-A - (read P) 
Incoming parity: We read the P or Q. 

Outgoing parity: Read first symbols if P, read second symbols if Q. 

Main change: x and y "stage 4" symbols disappear- (more on them below), and parity is set to reflect P 
vs. Q. 

Transfer to stage 6: 
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Stage 6 

Example: P-VvVvVvVvBbAa (read P) 

Incoming parity: If P at front, will read first of each pair. If Q at front, will read second of each pair. 
Outgoing parity: Will read first of each pair. 

Main change: A step of the Turing machine is simulated. The state (invisible subscript) changes. The 
new symbols get written onto the tape. V's turn back into U's (i.e. the growth flag is reset). The counter 
size (number of U's) doubles if necessary. The counter size is also halved for direct entry to stage 3. 
Transfer to stage 3: 



P * : [A a B b] H - 

Q* : [AaBb]-H- 

A * : A A (new state when reading an A) 

a * : A A (new state when reading a B) 

B * : B B (new state when reading an A) 

b * : B B (new state when reading a B) 

U * : U [U] (new state when reading an A) 

u * : U [U] (new state when reading a B) 

V * : U (new state when reading an A) 

V * : U (new state when reading a B) 
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Notes regarding the transfer: The [A a B b] portions depends on what symbol(s) are being written, 
and they are written with subscripts for stage 6, and the current (old) state (and they will be the last thing 
processed at the end of this transfer), while the H (and everything below) gets written with the new state 
and stage 3. 

The [U] portions are included only when the coiTcsponding Turing machine transition writes 2 sym- 
bols. This is where the length of the counter gets doubled. 

Stage 3 

Example: H-UUUUBBAABBBB 
Stage 4 

Example: HhUuXxUuXxBbAaBbBb 
Stage 1 

Example: H-UUXXUUXXBbOAaOBbOBbO 
Stage 2 

Example: HhUuXxUuXxBBCCBBDD 
Stage 3 

Example: H-UXXUXXBBCCBBDD 
Stage 4 

Example: HhUuXxXxXxBbCcBbDd 
Stage 1 

Example: H-UUXXXXXXBbOCCBbODD 
Stage 2 

Example: HhUuXxXxXxBBCCDDDD 
Stage 3 

Example: H-UXXXXXXBBCCDDDD 
Stage 4 



Example: HhUuXxXxXxXxBbCcDdDd (read h) 

Now we will see what the mixed-stage stuff is doing, where the stage-4 "x" turns into a "U x" pair 
with the U in stage 5 but the x in stage 4. Recall that h and u both disappear. 
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Stage 4.5 

Example: UxUxUxUx-Q-A-B-B - (read x) 

Now we see that that the stage-5 U's will get ignored as stage 4 finishes by processing the counter a 
second time. 

Stage 5 

Example: - Q- A- B- B- UxUxUxUx 

And we see that this time the stage-4 x's will get ignored during the stage-5 processing. Stage 5 
doesn't do much because it is mostly serving as a signal to the counter regarding what stage is happening. 
The counter knows it is still stage 4 while the second symbols are getting read, and the counter knows 
stage 5 has arrived when the first symbols get read. So stage 5 has to have a fixed parity during its 
processing, making it a boring stage. 

Stage 6 

Example: QAaBbBbUuUuUuUu 

And so on... The counter will double in size on the next step if necessary. It works well in simulation. 

As mentioned in [ilOl . the fact that tag systems can compute efficiently means that all known small 
universal Turing machines work in polynomial time. 
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